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Abstract — Q In this paper we present a thorough analysis of non 
binary LDPC codes over the binary erasure channel. First, the 
decoding of non binary LDPC codes is investigated. The proposed 
algorithm performs "on-the fly" decoding, i.e. it starts decoding 
as soon as the first symbols are received, which generalizes the 
erasure decoding of binary LDPC codes. Next, we evaluate the 
asymptotical performance of ensembles of non binary LDPC 
codes, by using the density evolution method. Density evolution 
equations are derived by taking into consideration both the 
irregularity of the bipartite graph and the probability distribution 
of the graph edge labels. Finally, infinite-length performance of 
some ensembles of non binary LDPC codes for different edge 
label distributions are shown. 

I. Introduction 

Data loss recovery - for instance, for content distribution 
applications or for distributed storage systems - is widely 
addressed using FEC (Forward Error Correction) techniques 
based on error correcting codes. These codes are dealing with 
erasure channels, i.e. a channel that either transmits the data 
unit correctly (without error) or erases it completely. In the 
case of content distribution applications, the potential physical 
layer CRC, or physical layer FEC codes, or transport level 
UDP checksums, may lead a receiver to discard erroneous 
data units. For distributed storage systems, data loss may be 
due to broken servers, Denial-of-Service (DoS) attacks, etc. 

The performance of error correcting codes over erasure 
channels can be analyzed precisely, and a flurry of research 
papers have already addressed this issue. Low-density parity- 
check (LDPC) codes [1], [2] with iterative decoding [3] proved 
to perform very close to the channel capacity with reasonable 
complexity [4] [5]. Moreover, "rateless" codes that are capable 
of generating an infinite sequence of repair symbols were pro- 
posed in [6] [7]. LDPC codes were generalized by Tanner [8], 
by introducing the sparse graph representation and replacing 
the Single Parity Check (SPC) constraint nodes with error 
correcting block codes. Nowadays, these codes are known as 
GLDPC codes and were recently investigated for the BEC [9], 
[10], [11]. Over the past few years there has been an increased 
interest in non binary LDPC codes due to their enhanced 
correction capacity, but at this time only few works are dealing 
with the BEC [12], [13]. In this paper we give a thorough 
analysis of non binary LDPC codes over the BEC. The paper is 
organized as follows: in Section|Il]we review some background 
on the construction of non binary LDPC codes. The decoding 
of non binary LDPC codes over the BEC is addressed in 
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Section [TTT] In Section |IV] we derive the density evolution 
equations taking into consideration both the irregularity of the 
bipartite graph and the probability distribution of the graph 
edge labels. Thresholds of some ensembles of non binary 
LDPC codes for different edge label distributions are shown 
in Section IVl 

II. Non binary LDPC codes 

We denote by F g the Galois field with q elements. For 
practical reasons, we will assume that q is a power of 2, even 
if this condition is not always necessary. Thus, we set q = 2 P , 
where p is the vector space dimension of ¥ q over F2 (each 
time we refer to F g as a vector space, we consider its F2- 
vector space structure). We fix once for all an isomorphism of 
F2-vector spaces: 

n ^ f, (i) 

and we say that (bo, ... , <E F?J are the constituent bits 

of the symbol s G ¥ q , if they correspond to each other by the 
above isomorphism. 

Let L be a multiplicative group acting on the vector space 
F g . For instance, we may have: 

• L = F*, acting on ¥ q via the internal field multiplication; 

• L = M*(F2), the multiplicative group of invertible pxp 
matrices, acting on F g via the isomorphism F?J ¥ q 
from ([]])■ 

The action of L on ¥ q will always be denoted multiplicatively, 
that is: 

LxF,^F,: (h, s) i-> hs (2) 
For any matrix H G Mm,at(L) one can define a code: 
C = kei(H) (3) 

JV 

= {(si, . . . , s N ) I ^ h m.nS„ = 0, Vm = 1, . . . , M} 

n=l 

If L = F* acting on ¥ q via the internal field multiplication, 
then C is a ¥ q -linear code, but this does not happen for general 
L. 

The Tanner graph associated with the code C, denoted by H, 
consists of N variable nodes and M check nodes representing 
the N columns and the M lines of the matrix H. A variable 
node and a check node are connected by an edge if the 
corresponding element of matrix H is not zero. Each edge 
of the graph is labeled by the corresponding non zero element 
of H. Thus, from now on, we refer to the elements of L as 
labels. We also denote H(n) the set of check nodes connected 



to a given variable node n £ {1,2,..., N}, and by H(m) 
the set of variable nodes connected to a given check node 
me {1,2,..., A/}. 

A. The binary image of a non binary code 

Any sequence (si,...,sjv) S F;^ may be mapped into 
a binary sequence of length Np via the isomorphism of 
([TJ. The binary sequences associated with the codewords 
(si,...,sn) € C constitute a linear binary code Cbin C 
F^ p , which is called the binary image of C. Moreover, the 
action (O of the multiplicative label group L on F q induces 
a group morphism from L into the group of vector space 
endomorphisms Cr 2 (F q ,F q ), and identifying ¥ q and F?? via 
(HJ, we get a morphism: 



L -► £ F2 (F g ,F g ) ^ £ F2 (Ff,F^) = M P (F 2 ) 



(4) 



Replacing each coefficient of the matrix H 6 Mm,at(L) with 
its image under the above morphism, we obtain a binary matrix 
f/bin G Ma/ p jv p (F2), which is simply the parity check matrix 
of the binary code Cbin- While the encoding may be performed 
using either the non binary code or its binary image, the 
iterative decoding of a non binary code on its binary image 
generally yields very poor performance. 

III. Decoding non binary LDPC codes 

For general channels, several decoding algorithms for non 
binary LDPC codes were proposed in the literature [14], [15], 
[16]. Because of the BEC specificity, these algorithms are 
all equivalent over the BEC, and they can be described in 
a slightly different manner, as presented below. 

A. Decoding over the BEC 

In this section we assume that a non binary LDPC code is 
used over BEC(e) - the binary erasure channel with erasure 
probability e. Thus, the length N sequence of encoded F g - 
symbols is mapped into the corresponding binary sequence of 
length Np, which is transmitted over the BEC, each bit from 
the binary sequence being erased with probability e. We say 
that a F g -symbol is : 

* received, if all of its constituent bits are received; 

« erased, if all of its constituent bits are erased by the 
channel; 

* partially erased, if some of its constituent bits are erased 
by the channel and some others are received. 

At the receiver part, the received bits are used to reconstruct 
the corresponding F 9 -symbols. The reconstruction may be 
complete, partial, or lacking, according to whenever the cor- 
responding symbol is received, partially erased, or erased. 

Let n be a variable node of the Tanner graph and s G F 9 . We 
say that the symbol s is eligible at the variable node n, if the 
probability of the n th transmitted symbol being s is non zero. 
Tacking into consideration the channel output, the a priori set 
of eligible symbols, denoted by <8 n , consists of the symbols 
that fit with the received constituent bits (if any) of the n" 1 
transmitted symbol. Thus : 

* <§ n = Fq, if the symbol is erased, 



< 4£ F q , if the symbol is partially erased, 

« card(<f„) = 1, if the symbol is received. 

These sets constitute the a priori information of the decoder. 
They are iteratively updated by exchanging extrinsic messages 
between variable and check nodes in the graph. Each message 
is a subset of F q , representing a set of eligible symbols. 
Precisely, the message sent by a graph node on an outgoing 
edge is a set of eligible symbols, which is computed according 
to messages received by the same node on the incoming edges. 
We use the following notation: 

• s^m.n the set of eligible symbols sent by the variable node 
n to the check node m; 

• &m,n the set of eligible symbols sent by the check node 
m to the variable node n. 

Finally, if y, y u y 2 C F q and h eh we define: 

h,y = {hs\se y} 
y x +y 2 = { Sl + S2 1 si e y u s 2 e y 2 } 

The iterative decoder for the BEC can be expressed as 
follows: 

Initialization step 

• variable-to-check messages initialization 

Iteration step 

• check-to-variable messages 

n'£H(m)\{n} 

• variable-to-check messages 

\m' £H{n)\{m} 

• a posteriori sets of eligible symbols 



n 



The decoder stops when all the a posteriori sets of eligible 
symbols § n are of cardinality 1, or when a maximum number 
of iterations is reached. It is important to note that any set 
of eligible symbols (S ni ^ m ^ n ,SS m ^ n , or is a F2-affine 
sub-space of F q , in particular, its cardinal is a power of 2. 

B. Minimum-delay decoding 

In this section we propose a minimum-delay decoding 
algorithm over the BEC, in the sense that the decoding starts 
since the reception of the first bits, which is suited for Upper- 
Layer Forward Error Correction (UL-FEC). 

The minimum-delay decoding of non binary codes consists 
of removing symbols from the sets of eligible symbols: 

• initialize S n = F q , n — 1, . . . , N 

• each time a new bit is received, identify the variable node 
n of which the received bit is a constituent bit, and then: 

A{n): remove symbols from <§ n whose corresponding 
constituent bit is different from the received bit 



B(n): process the check nodes m 6 H(n), then update 
the sets of eligible symbols $ n * <— S n i, for each 
n' e H(m) \ {n} 
C(n): For each of the above n's, if by updating S n i its 
cardinal has been reduced, go to B(n «— n'). 
The decoder stops when all the sets <S n are of cardinality 1. 

1) Decoding inefficiency: It follows that the minimum- 
delay decoding is actually an on-the-fly implementation of 
the previous iterative decoding. A performance metric that is 
often associated with on-the-fly decoding is the decoding in- 
efficiency, defined as the ratio between the number of received 
bits before decoding stops and the number of information bits. 
Let K\,i n be the binary dimension of the code, and ^received be 
the number of received bits before decoding stops. Then the 
inefficiency p is defined as: 

-^received 



(5) 



The expectation of the random variable p, denoted by p m , is 
called average inefficiency. In practice p m can be estimated 
by Monte-Carlo simulation. 

The average inefficiency of the on-the-fly decoding can 
be related to the failure probability of the iterative decoding 
(sectionHTB. Precisely, for any e G [0, 1], letp(e) be the failure 
probability of the iterative decoding assuming that e is the 
channel erasure probability. Assuming that the function p is 
integrable on [0, 1], we have: 



1 



p(e) de 



(6) 



IV. Density evolution 



Density evolution for non binary LDPC codes over the BEC 
was already derived in [12], assuming an uniform distribution 
on the edge labels. In loc. cit., the authors suggest that the 
distribution of the edge labels represents a degree of freedom 
that should be integrated to our understanding of capacity 
approaching iterative coding schemes. To do so, we derive 
the density evolution of non binary codes tacking into consid- 
eration the variable and check nodes degree distributions, but 
also the probability distribution of the edge labels. We use the 
following notation: 

• Ad is the fraction of edges connected to variable nodes 

d v 

of degree d, X(X) = ^^ArfAT d_1 is the polynomial of 

d=l 

variable node degree distribution; 

• pa is the fraction of edges connected to check nodes of 

degree d, p{X) = pdX d ~ x is the polynomial of check 

d=l 

node degree distribution; 

• / : L — > [0, 1] the probability distribution function 
defined by f(h) = fraction of edges with label h G L. 
By extending the notation, for a given sequence h = 

{h lt ..., hi) we define /(h) = [| f(ht). 

i=l 



Without losing generality, we may assume that the all-zero 
codeword is transmitted. Thus, any set of eligible symbols 
(S'n, srf m<n , 3§m,n> or $n) is a F2-linear sub-spaces of ¥ q . 
Table U gives the list of the possible values of the a priori sets 
of eligible symbols S n for the case of a Fs-code, according 
to the received binary sequenc^H 

TABLE I 

Possible values of the a priori sets of eligible symbols 



received bits* 






XXX 


F 8 




Oxx 


{0,1,2,3} 


e-(l-e) 


xOx 


{0,1,4,5} 


6^(1-6) 


xxO 


{0,2,4,6} 


e-(l-e) 


xOO 


{0,4} 


e(l-ey 


0x0 


{0,2} 


e(l-ey 


OOx 


{0,1} 


6(1-6)=- 


000 


{0} 


(1 - e)° 



* Symbol x denotes an erased bit 

Let Gr(F 9 ) be the Grassmannian of ¥ q , that is the set of all 
F2-linear subspaces of ¥ q . For V G Gr(F g ), we note: 



Pt{V) 
Qi(V) 



V) 

■ V) 



(7) 
(8) 



where superscript (£) is used to denote sets of eligible symbols 
computed at the £ th iteration. Thus, the decoding is success- 
fully if and only if: 



lim P,({0}) 

— »+oo 



1 



(9) 



In order to simplify the notation, we define: 
. For any V G Gr(F g ): 

7 (y) : =p (y) = Pr(4 l = y) 

4° : = {V = (Vi, . . . , Vj) I J2 V i = y } C Gr(¥ q Y 

i=l 

I 

■= {(V ,V) = {Vq,V U -..,Vi) I f]Vi 
C Gr(F 9 ) /+1 

,hi) G L J and V = (Vi,. 



r(I) 



For any h 

Gr(F 9 )': 



Vi ) G 



For any V = ( V\ , 



r 1 ), h-v-=(hiVi,. 

, Vi) G Gr(F g ) 7 : 



. , hiVi) 



i=l i=l 

Let (m, n) be an edge of the tanner graph. Assume that 
H(m) = {n, rix, . . . , rid-i}, where d is the degree of the 
check node rn. To simplify the notation, we set hi — h m ^ ni , 
the non zero label of the edge (m, rij), for i = 1, . . . , d — 1. 

'Here we identify Fg = {0, 1,2,..., 7}, and the constituent bits of a given 
symbol correspond to the binary decomposition. 



The probability of ^mtn ^ being equal to V, conditioned on 
h = (hi, . . . , hd-i), may be computed as: 

Pr(^+ 1} = V | h) = (nW 1 ^)) (10) 

Averaging over all possible label sequences h we get: 

= E /(*)• E p ^ h-1 -^) 

Averaging over all possible check node degrees d, we obtain: 

d c 

•>(.d-l), 

I Hd " 4 

d=l 

Now, consider an edge (n, m) of the Tanner graph, and 
let the variable node n be of degree d and TL(n) = 
{m, mi, ■ ■ ■ , m<l-i}- To simplify notation, we set hi = h mi , n , 
the non zero label of the edge (n, m,), for i = 1, . . . , d — 1. 
The probability of &/Jn~^ being equal to V, conditioned on 
h = (hi, . . . , hd-i), may be computed as: 



Q i+ i(V) = J^( Pd -Q{ d + - 1 \v) 



(12) 



Pr 



(d-l 
, ..... i=1 



(13) 

Again, by averaging over all possible label sequences h, it 
follows that: 



P^\V) := Pr«+ 1} = ^) 

= E /( h )- E 7(Vo)Qm(h-Y) 



her 



(14) 



(v ,v)ez^ 



Finally, averaging over all possible variable node degrees d, 
we obtain: 

dv 

P £+ i(V) = J2(\ d .pg- 1) (V)) (15) 
d=l 

Proposition 1: Let V, W G Gr(F g ) and heL such that 
W = W. Then: 

Qiti 1) W = E \f(h-h)- ^^(h^.V)] (16) 

P& 1J (W) = 

E /(ft-h). ^ 7W)0/+i(h-Y)] (17) 



heL d 



(v ,v)eii,- 



where /i • (hi, ... , hd—i) — (hhi, . . . , hhd-i)- In particular, if 
/ is the uniform distribution, then Q^ 1 1 \\V) — Qf + i\v) 

^p^- 1 \w) = p^- 1 \v). 



We say that V and W are conjugate if there exists del 
such that W = hV and denote by Gr(F g )/L the quotient set 
of conjugation classes. 

Corollary 2: Assume that / is the uniform distribution and 
let V £ Gr(¥ q ). Then Q e (V) and P e (V) depend only on the 
conjugation class of V in Gr(F g )/L. 

Corollary 3: Assume that / is the uniform distribution and 
that L = M*(F2), the multiplicative group of invertible p x p 
matrices, acting on ¥ q via the isomorphism F?? ^> ¥ q from 
CD. Let V e Gr(Fg). Then Qi(V) and P e (V) depend only on 
the dimension of the vector space V. 

The above corollaries may be used to simplify the density 
evolution formulas, assuming a uniform distribution of the 
graph edge labels. For instance, if L = M*(F 2 ), one can 
derive the same formulas as in [12]. 

V. Thresholds 

We denote by Er q j j (X, p, /) the ensemble of LDPC codes 
over Fg, with labels group L, distribution degree polyno- 
mials A and p, and probability distribution of edge labels 
/. Whenever the Galois group ¥ q and the labels group L 
are subunderstood, we will simply use E(X,p,f). We also 
denote by p± ¥q x(X, p, f) (or simply p th (A, p, /)) the threshold 
probability of the above ensemble, that is (see also ©): 

Pth(A, p, f) = max{e | lim P e ({0}) = 1 on BEC(e)} (18) 

By fixing the polynomials of degree distribution A and p, 
the probability threshold p± may be seen as a function of 
the probability distribution /. This is illustrated in Fig. [TJ and 
Fig. The Galois field is F 4 and the labels group L = F|, 
acting of F4 by the internal field multiplication. The horizontal 
axes /(l) and /(2) represent the probabilities of edge labels 
being 1 and 2, respectively. Thus, the probability of edge labels 
being 3 is given by /(3) = 1 — /(l) — /(2). We drawn the 
surface representing p± as function of /(l) and /(2). The 
top of the surface is plotted in red, the middle in green, and 
the bottom in blue. The two figures correspond to two couples 
(A, p) of degree distributions that were also considered in [12]. 
In Fig. [TJwe fix A = X and p = X 2 . The maximum p t h is 
obtained for the uniform distribution /(l) = /(2) = /(3) = 
1/3 and its value is equal to 0.5772. The minimum p± = 0.5 
is obtained for the three distributions concentrated in a single 
label (such codes are equivalent to binary codes). In Fig. [2] we 
fix X(X) = X 2 and p(X) — X 3 . For the uniform distribution 
/(l) = /(2) = /(3) = 1/3, the threshold p th = 0.6348. 
The minimum p t h = 0.6346. The maximum — 0.6474 is 
obtained for the three distributions concentrated in one single 
label. 

These two examples highlight a more general phenomenon 
that we observed for other ensembles of codes, as shown for 
instance in Fig. [3] For a given Galois field ¥ q , and given 
polynomials A, and p, it is possible to find a probability 
distribution / of edge labels, such that: 

> edge labels are equal to 1 with high probability (meaning 
that f(l) is close to 1) 

• Pth F „F* ( A > ft /) « maxpthF, ,f« (-V ft /) 



1 



Fig. 1, Probability threshold of the ensemble i?p 4 F « (A = X,p = X 2 ,f) 
as function of labels probability distribution /. 
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Fig. 3. Probability thresholds of the ensembles £( p (A, p, /) and 
£f 4 ,f* (A,p, /) for A = 0.5X + 0.5X 4 , p = X 5 , and different labels 
probability distributions /. 

For instance, considering the ensemble of codes over Fs 
from Fig. U if / is defined by /(l) = 0.8, /(7) = 0.2, and 
f(i) = for 1 < i < 7, then Pth F? , F |(A, p, f) = 0.4483. In 
this case only few Galois field multiplications are needed, and 
the decoder complexity is considerably reduced. 

VI. Conclusions 

In this paper we investigated the decoding of non binary 
LDPC codes over the BEC, and we introduced a minimum- 
delay decoding suited for UL-FEC. We also derived the 
density evolution equations taking into consideration both 
the irregularity of the bipartite graph of the code and the 
probability distribution of the graph edge labels, giving a 
thorough understanding of the asymptotical behavior of en- 
sembles of non binary LDPC codes. A non-uniform probability 
distribution of the edge labels might improve the decoder 



Fig. 2. Probability threshold of the ensemble Ef 4 ,fJ (A = X 2 , p = X 3 , /) 
as function of labels probability distribution /. 

performance, but the most important advantage is that the 
decoder complexity can be significantly reduced. The design 
of capacity approaching non binary LDPC codes will be 
addressed in future works. 
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